A proposal for improving WordNet Domains

نویسندگان

  • Aitor Gonzalez-Agirre
  • Mauro Castillo
  • German Rigau
چکیده

WordNet Domains (WND) is a lexical resource where synsets have been semi-automatically annotated with one or more domain labels from a set of 165 hierarchically organized domains. The uses of WND include the power to reduce the polysemy degree of the words, grouping those senses that belong to the same domain. But the semi-automatic method used to develop this resource was far from being perfect. By cross-checking the content of the Multilingual Central Repository (MCR) it is possible to find some errors and inconsistencies. Many are very subtle. Others, however, leave no doubt. Moreover, it is very difficult to quantify the number of errors in the original version of WND. This paper presents a novel semi-automatic method to propagate domain information through the MCR. We also compare both labellings (the original and the new one) allowing us to detect anomalies in the original WND labels.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enriching the Integration of Semantic Resources based on WordNet

In this paper we present the enrichment of the Integration of Semantic Resources based in WordNet (ISR-WN Enriched). This new proposal improves the previous one where several semantic resources such as SUMO, WordNet Domains and WordNet Affects were related, adding other semantic resources such as Semantic Classes and SentiWordNet. Firstly, the paper describes the architecture of this proposal e...

متن کامل

WordNet and Automated Text Summarization

Proposals for text classification and information retrieval have been recently presented making use of the WordNet ontology. Generally, this methodology requires statistical induction of synset clusters and entails costly training of specific key domains. The present proposal intends to show that a simple recursive evaluation procedure and WordNet are rich enough to obtain useful results in tex...

متن کامل

A graph-Based Approach to WSD Using Relevant Semantic Trees and N-Cliques Model

In this paper we propose a new graph-based approach to solve semantic ambiguity using a semantic net based on WordNet. Our proposal uses an adaptation of the Clique Partitioning Technique to extract sets of strongly related senses. For that, an initial graph is obtained from senses of WordNet combined with the information of several semantic categories from different resources: WordNet Domains,...

متن کامل

Enhancement Electronic evaluation for Semantic Arabic Oral Exam

From the importance of knowledge in the speech, we knew the importance of oral exam. So in this paper we integrated BOW (Bag of Word), LSA(Latin Semantic Analysis), ASR (automatic speech recognition), zero crossing rate, and Ontology based approach to automate the online oral exam especially in Arabic language with take into consideration the authentication problem. Our proposal method faced ma...

متن کامل

Improving Semantic Knowledge Base for Transfer Learning in Sentiment Analysis

Sentiment analysis deals with the computational treatment of opinion, sentiment, and subjectivity in text, has attracted a great deal of attention. Sentiment analysis has been widely used across a wide range of domains in recent years, such as information retrieval, question answering systems and social network. This paper presents a new method for improving the semantic knowledge base for sent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012